Distribution of components in the /c-nearest neighbour 
random geometric graph for k below the connectivity 

threshold 

Victor Falgas-Ravry * 

1 November 29, 2012 

O' 

> 

Abstract 

OO 
(N 

5? 

I of Walters 13 1. As an application of our result, we show that the distribution of 

the connected components of S n ,k below the connectivity threshold is asymptotically 
Poisson. Similar results were independently obtained by Balister. 

■ 1 Introduction 

In this paper, we study the distribution of small connected components below the con- 
nectivity threshold in the /c-nearest neighbour model. We prove that they lie far apart, 
and are approximately distributed like a Poisson process in both a numerical and a spatial 
sense. 



(N 



Let S n ^k denote the random geometric graph obtained by placing points inside a 
square of area n according to a Poisson point process of intensity 1 and joining each 
such point to the k — k(n) points of the process nearest to it. In this paper we show 
that if V(S n ,k connected) > n~ 71 then the probability that «SVi,fc contains a pair of 
'small' components 'close' to each other is o(n~ Cl ) (in a precise sense of 'small' and 
'close'), for some absolute constants 71 > and c\ > 0. This answers a question 



1.1 Definitions 



Let S n denote the square [0, ^/n\ 2 of area n, and let || • || denote the usual Euclidean norm 



in M 2 . 



Let k = k(n) be a nonnegative integer parameter. Scatter points at random inside S n 
according to a Poisson process of intensity 1. (Or equivalently, scatter N ~ Poisson(n) 
points uniformly at random inside S n .) This gives us a random geometric vertex set V. 
The k-nearest neighbour graph on V, S n ^ = S nt k(V), is obtained by setting an undirected 
edge between every vertex of V and the k vertices of V nearest to it. 

The Poisson law of the random pointset V coupled with our deterministic definitions 
of S n k gives rise to a probability measure on the space of /c-nearest neighbour graphs with 
vertices in S n , which we refer to as the k-nearest neighbour model S n 
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As this model has some inherent randomness, rather than proving deterministic state- 
ments about S n> k we are mostly interested in establishing that properties hold for 'typical' 
fe-nearest neighbour graphs. Formally, given a property Q of A;-nearest neighbour graphs 
(i.e a sequence of subsets Q n of the set of all geometric graphs on S n ) and a sequence 
k = k(n) of nonnegative integers, we say that S n ^(n) has property Q with high probability 
(whp) if 

lim nS nMn) e Q n ) = 1. 
1.2 Previous work on the k-nearest neighbour model 

An important motivation for the study of the connectivity properties of the /c-nearest 
neighbour model comes from the theory of ad-hoc wireless networks: suppose we have 
some radio transmitters (nodes) spread out over a large area and wishing to communicate 
using multiple hops, and that each transmitter can adjust its range so as to ensure two- 
way radio contact with the k nodes nearest to it. The connectivity of this radio network 
is modelled using S n k in a natural way. As a result, the model has received a significant 
amount of attention, though many questions remain. (See e.g. the recent survey paper of 
Walters 



12]. 



Elementary arguments show that there exist constants q < c u such that for k(n) < 
c/logn S n> h is whp not connected, while for k(n) > c u logn S n k is whp connected. A 



bound of Cu < 5.1774 was obtained by Xue and Kumar [14J, using a substantial result of 
Penrose [lj| for the related Gilbert disc model 0]. An earlier bound of c u < 3.5897 could 



also be read out of earlier work of Gonzales-Barrios and Quiroz [9]. These results were 
substantially improved by Balister, Bollobas, Sarkar and Walters in a series of papers 0, 
0, 0| in which they established inter alia the existence of a critical constant c*: 0.3043 < 
c* < 0.5139 such that for c < c* and k < clogn, S n ^ is whp not connected while for 
c > c* and k > clogn, SVi.fc is whp connected. Building on their work, Walters and the 
author [3] recently proved that the transition from whp not connected to whp connected 
was sharp in k: there is an absolute constant C > such that if S Uj k is connected with 
probability at least e > and n is sufficiently large, then for k' > k + Clog(l/e), S n ^' is 
connected with probability at least 1 — e. 

As part of their results, Balister, Bollobas, Sarkar and Walters 0] showed that when 
0.3 log n < k < 0.6 log n whp all of the following hold: 

(i) all edges have length 0(vTog~n), 

(ii) there is a unique 'giant' connected component, 



(iii) all other components have diameter 0{y/\og n). 

(Strictly speaking, they only showed that S n f. has at most one giant component; that 
such a component exists follows from the study of percolation in the fc-nearest neighbour 
graph: see e.g. [2|.) 



Refining their techniques, Walters 13| showed that around the connectivity threshold, 



there were no 'small' components (of diameter O(ylogn)) lying 'close' to the boundary 
of S n (within distance O(logn)), and used this to improve the upper bound on c* to 



0.4125. Towards the end of his paper [131 ] . Walters asked a number of questions about the 
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properties of small components of S n k below the connectivity threshold, with the aim of 
completing the picture we currently have and perhaps improving the upper bound on c*. 

1.3 Results of the paper 

In this paper we contribute to this project by proving that the 'small' components are 
whp far apart (or more precisely, not 'close' together) and that they are distributed like a 
Poisson point process inside S n . This answers one of Walters's questions. 

To state our results formally, we must ascribe a precise meaning to 'small' and 'close'. 
We recall a result of Balister, Bollobas, Sarkar and Walters to do this. 

Lemma 1 (Lemma 1 of 0]). For any fixed a\,a2 with < a\ < a2 and any j3 > 0, 
there exists A = X(a±,a2, 0) > 0, depending only on a%,a2 and (3, such that for any k 
with ailogn < k < 0:2 log n, the probability that S n ^k contains two components each of 
diameter at least A^log n or any edge of length at least AyTogn is 0(n~ 13 ). 

Definition 1. Let A = max(A(0.3, 0.6, 2), e 2 ). A component shall be deemed small if its 
has diameter less than \^f\ogn. Two small components shall be deemed close if they 
contain two points lying at a distance less than 8A^/log n from one another. 

Our main result is the following: 

Theorem 2. There exist absolute constants 71 > and c\ > such that if 

^(S n ,k connected) > ra~ 71 

then 

^(S n .k contains a pair of small, close components) = o(n~ Cl ). 

This answers Question 1 in Walters [13|] . Balister has independently obtained a similar 
result (personal communication). 

Remark 1. Theorem 1 of 12] showed that, for n large enough, we need to increase k by at 
most a constant times log(l/e) for S n k to go from having an e chance of being connected 
to having a 1 — e chance of being connected. Assuming there is a 'bluntness' converse to 
this result, i.e. that we need to increase k by at least a (smaller) constant times log(l/e) 
for this transition to occur, then there must be some constant 5 : < 5 < c* such that for 
all k = k(n) with k(n) > (c* — <5)logn we have F(S n! k connected) > n -71 . In particular 
Theorem is not vacuous since there is a range of k for which 

n -7i < P(5 , n fc connected) « 1 - o(n _Cl ). 
(For example, k = [clognj for any c : c* — 5 < c < c* will do.) 

Next we turn to the distribution of the small components of S n ^- Let X = X n ^ denote 
the number of small connected components of S n ^- (Since there is whp a unique non-small 
connected component [3J], X is whp the number of components of S n k minus 1.) Also, 
given v > and A C NU{0}, let Po l ,(A) denote the probability a Poisson random variable 
with parameter v takes a value inside A. 

As an application of Theorem [51 we prove: 
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Theorem 3. There exist absolute constants 72 and C2 > such that if k = k(n) is an 
integer sequence with ¥(S n; k connected) > ra -72 for all n, then, writing v = v(n) for 
— log (P(S , „ j fc connected)), we have 

sup |P(Xe A)-Po v {A)\ =o(n- C2 ). 
ACNU{0} 

Corollary 4. Let k{n) be an integer sequence. Suppose there is a subsequence (fe(raj)) igN 
such that 

T(S ni M n A connected) — > e~ v 

for some constant v > 0. Then the law of X n .^ n ^ converges in distribution to Poisson 
with parameter v: 

^(X n . Mni) ) A Poisson(z^). 

We also prove a spatial analogue of Theorem [3j not only is the number of small com- 
ponents (approximately) Poisson distributed, but their spatial location is (approximately) 
distributed according to a Poisson point process inside S n . We defer a precise statement 
of this result, Theorem I28[ until Section 4. 



1.4 Structure of the paper 

We follow the strategy introduced by Balister, Bollobas, Sarkar and Walters in [4] and 
developed in 0]: we prove Theorem [2] by looking at local events. In Section 2, we prove a 
local result, Theorem [5j which can be thought as an analogue of the key Lemma 7 in Q]. 

Theorem [5] is then used in Section 3 together with local-global correspondence results 
from [7] to prove the global result Theorem [2j 



Finally in the last section we use a form of the Chen-Stein Method [y, [llj due to 
Arratia, Goldstein and Gordon [l| together with our results from the first two sections to 
prove Theorems [3] and I 



2 Proof of the local theorem 

In this section we consider the connectivity of the /c-nearest neighbour random geometric 
graph model on a local scale (i.e. within a region of area O(logn)). 

Pick n sufficiently large. Let M = max(160[A] , 50). (We remark that this is similar 
to but slightly larger than the choice of M in [7].) We consider a Poisson point process of 
intensity 1 in the box 

rr r M A M A ! 2 

We place an undirected edge between every point and its k nearest neighbours to obtain 
the graph U n , k . 

Let us define two families of events related to the connectivity of U n ^: 

Definition 2. Let be the event that U n) k has a connected component wholly contained 
inside the central subsquare \U n . Let be the event that U n ^ has at least two connected 
components wholly contained inside the central subsquare \\J n . 

Our aim is to prove: 
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Theorem 5. There exist constants 03,04 > such that for all integers k with k £ 
(0.3 log n,0.6 log n), we have 



P(Sjfe) < c 3 n- c *F(A k ) + o(re 



-2\ 



This is saying that on a local scale it is far less likely that we have two small connected 
components close together than just one small connected component on its own. Our 
proof strategy is as follows: we show that whp if occurs then there must be a large 
empty region inside U n to which many points can be added without joining up all the 
components of U n: k which are contained inside \U n . This ensures that Ak still occurs for 
the new pointset, and can be exploited to show Ak is much more likely than Bk- 

To prove Theorem [51 we shall need a simple lemma on the concentration of Poisson 
random variables. 

Lemma 6. There exist constants \% and X2, such that for all integers k with k £ (0.3 log n, 0.6 log n), 
the probability that there is any point x £ U n (not necessarily in the pointset arising from 
the Poisson point process) such that the ball of radius Aiv^ogn about x contains at least k 
vertices of U n k or that the ball of radius A2v / l°g n about x contains fewer than k vertices 
ofU ntk is o(n- 2 ). 

Moreover, A 2 can be chosen to be less than A. 

Remark 2. This says that the probability U ri ^ contains a pair of vertices not joined by an 
edge and lying within distance \\\J\og n of each other, or a pair of vertices joined by an 
edge and lying at distance at least \2\f\0gn from one another is o(n~ 2 ), i.e. a negligible 
quantity. 

Proof of Lemma® We first show that there is a constant A2 < A such that the probability 
that there is some point in U n with fewer than 0.6 log n vertices of U n ^ within distance 
A2v / k ) g n of itself is o(re~ 2 ). 

Let r denote the intersection of the integer grid Z 2 with U n . Let X' 2 = 2y / ^ and 
consider any x G T. Since M > 50, at least one of the four standard quadrant quarter 
discs of radius X' 2 \/log n about x is contained inside U n ; call this quarter disc D. The 
probability that D contains fewer than 0.6 log n vertices of U n ^ is at most 

/ TrA' 2 ^ (vrA 2 2 log re/4) - 6 
0.6(logn)exp I — logn 1 



(0.6 log re)! 

/1 % ( 7r ^9 2 /, \ fi \ 1 / e7rA'r, 2 . 

< 0.6(logn)exp -^(log re) + 0.6(log n) log( z 

\ 4 O.b x 4 

< 0.6(log re) exp (— (e 3 — 3) log re) 

< 0.6(logre)re" 



-4 



Thus the probability that V contains any point with fewer than 0.6 log re vertices of U n ^k 
within distance \' 2 ^/\og re is less than |T| x ' 6 ^4 g " = o(re -2 ). Since every point in U n lies 
at distance at most v2 from a point in T, this proves our claim for A2 = A 2 + 1, say. This 
is at most e 2 , which is less than A, as required. 

Now let us show that there is a constant Ai such that the probability that there is 
some point in U n having more than 0.3 log re vertices of U n ^ within distance \\\J\og n of 
itself is o(n -2 ). 
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Let r again denote the intersection of the integer grid 1? with U n . Let A^ 



/cxp(-49/3) 



and consider any x S V. Let D denote the intersection of the disc of radius A^yTogn about 
x with U n . The probability that D contains more than 0.3 log n vertices of U n t is at most 



[0.3 log n] 

[0.3 log n\\ 
(irXf log n) 



exp(-|D|) + 

0.3 log n 



\D 



ro.31ogn]+l 



< 



[0.3 log n] 



([0.3 log n] + 1)! 

A'j 2 log n 



1 ttA? 
1 + ^ + 



exp(-|Z>|)- 



0.3 



< exp I 0.3 log n I log (nX'i log n) — log (0.3 log n/e 



+ ••■ 



TTAf 

0.3 



1 - 7rAf/0.3 



< exp ( 0.3 log n 



-— + log— ) +0(1) 
3 6 0.3 ' 



< exp (-41ogn + Oil)) 
- 3 ). 



on 



Thus the probability that V contains any point with more than 0.3 log n vertices of U n , 



within distance A' x \/log n is less than |T| x exp (— 41ogn + O(l)) 



on 



Now every 



point in U n lies at distance at most \pl from a point in V, so this proves our claim for 
Ai = Ai/2, say. □ 

Proof of Theorem^ Let us assume that there is no point in U n with more than k vertices 
of U n ^ within distance AiyTogn of itself or fewer than k vertices of U n ^ within distance 
\2\/log n of itself. We shall denote by C the set of pointsets we are thus excluding. By 
Lemma El P(C) = o(n~ 2 ). 



We consider a perfect tiling of U n into tiles of area -7^?, 
Explicitly, we shall choose 

N = max(N 1 ,N 2 ,N 3 ), 
where N%, N2 and N3 are the constants 



for some (large) constant N. 



No 



V5/X 



2 
1 



+ 1, 

4ygA2 
A1 2 



and 



—5 (1 + \/5)Ai + A 2 + V ((1 + y/E)\! + A 2 ) - (5 + 2V5)Ai : 



+ 1, 



each of which will appear 

at one of the stages of our argument. The choice oi N > N2 ensures that the inequality 

1/2 



1 / 4^5A 2 1 N 



holds, while the choice of N > N3 ensures that 

1 2A 2 A (l + vg) V 
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holds. (Both inequalities clearly hold for all N large enough, and it is easily checked by 
solving two quadratic equations that the values of N2 and N$ given above will do.) 
Given a pointset V C U n , write U nt k(V) for the fc-nearest neighbour graph on V . 

Definition 3. For each tile Q, let B k (Q) be the event that the pointset V resulting from 
the Poisson process on U n has the following properties: 

(i) V E B k (i.e. U nj k(V) has at least two connected component contained inside \U n ), 

(ii) Q contains no point of V, and 

(hi) for any set of points B C Q, (VU B) £ A k (i.e. U n ^(VUB) has at least one connected 
component contained inside \U n ). 

The key step in the proof of Theorem [5] is to show: 
Theorem 7. 

B k \Cc\jB k (Q). 

Q 

In other words, if B k occurs we can (except in a negligible proportion of cases) find an 
empty tile to which we can add many points and still have A k occurringin the resulting 
pointset. This can be thought of as an analogue of the key Lemma 7 in [7]. 

Proof of Theorem^ Let V be a pointset for which B k \C occurs. Say that a tile is empty 
if it contains no point of V . Let X, Y be the vertex sets of two small connected components 
of U n;k (V) witnessing B k . (At least two such components must exist, though there could 
potentially be more.) 

Let a be a vertex in XUY nearest to the bottom side of U n . Without loss of generality, 
we may assume that a E X. Let E be the horizontal line through a. Then all points of X 
and Y must lie on or above E. Now a lies in some tile, Q a say. 

We consider the tiles directly below Q a . Since N > N% > -jp, the topmost of these 
tiles must be empty. There are two cases to consider. Either all tiles directly below Q a 
are empty, in which case we let Q denote the one among them which is incident with the 
boundary of U n ; or there is some tile directly below Q a , which is nonempty. Then let Q' 
be the topmost of these nonempty tiles, and let Q denote the (empty) tile directly above 
it. 

Lemma 8. 

V E B k {Q). 

Proof. Let B C Q be a nonempty set of points in Q, and let V' = PUB. Our claim is that 
V E A k . To establish this, it is enough to show that there are no edges from Y to Y c in 
U n , k (V) (as then Y will be a connected components of U n ^{T") contained inside \U n ). 

Since the only edges in U n:k (V') that are not also edges of U n!k (V) have at least one 
end in Q, it suffices to show no vertex of Y is joined by an edge of U n ^ k {V) to a point 
b E B. We split into two cases. 

First of all, suppose Q is incident with the boundary of U n . Since V ^ C and A2 < ^ 
(since A2 < A < j^j), we know that no point in Q can have any of its k nearest neighbours 
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Figure 1: The points a, be, d and e and the line E. 

inside \U n and vice- versa. Thus there are no edges from QtoFC \U n and V' £ C 
as required. 

We now turn to the less trivial case where Q is not incident with the boundary of U n . 
Then the tile Q' directly below Q is nonempty: there exists c£VnQ'. 

Let R denote the distance between c and its k th nearest neighbour in V . For any b £ B, 
||c — b\ \ < ^-y/logn. Thus the distance between b and its k th nearest neighbour in V' will 
be at most R + ^\/\og n. Also, ac is not an edge of the /c-nearest neighbour graph on V, 
whence \ \a — c\\ > R and \\a — b\ \ > R — ^ \/log n for all b <E B. 

Now, let 6 G S C Q and suppose d is a point lying above E such that d is one of the k 
nearest neighbours of b in V' . Our earlier choice of N > N2 ensures the following holds: 

Claim 1. 

\\a — d\\ < Aiydogn. 

Remark 3. By our assumption on V (namely our assumption that V ^ C), this implies 
that ad is an edge in U n ^{V). Since a £ X it follows that d^Y . 

Proof of Claim[J[ This is an exercise in Euclidean geometry. (See Figure 4.1.) We know 
that ||6 — d\\ < R + -^ydog n. Let e be the foot of the perpendicular to E which goes 
through b. As b lies in a tile directly below a's tile, we have | \a — e\ \ < -^\/\ogn. It follows 
by Pythagoras's Theorem that 



Now the angle bed is obtuse. Hence, 

d\\ 2 - 

< (iJ+^Tbg^) 2 -(R-^-^nf + ^log7, 



\e-d\\ 2 < \\b - d\\ 2 - \\b - e\\ 2 



Finally, we have 



4V5„ r 1 

-jj-R V lo g n + log n 



| ci — cZ| | < \ \a — e\ \ + ||e — d|| 

< -^\/logn + ( -J^- R V l °g n +^2 lo & n 



Now i? < A 2 \/logn (since V £C). 
Substituting this into the above yields 



which by our choice of N > N2 is less than \\^J\ogn. □ 

On the other hand, suppose d is a point lying above the line E such that b is one of the 
k nearest neighbours of d in V' . Then our earlier choice of N > N3 ensures the following 
holds: 

Claim 2. 

\\a-d\\ < \\b-d\\. 

Remark 4. This implies that a is one of the k nearest neighbours of d in U n} k(V), and 
hence also in Un^iV)- As a G X it follows that d (f:Y . 

Proof of Claim\^ This is again an exercise in Euclidean geometry. (See Figure 4.1.) Since 
b is among the k nearest neighbours of d, it follows that | \b— d\ \ < A2\/l°g n (since d^V and 
V ^ C). Similarly, as ac is not an edge of Un^iV), we must have that ||o — c|| > Ai-v/logn. 
Let e be the foot of the perpendicular to the line E which goes through b. Since the 
triangle bed is obtuse, we have ||e — <i|| < \\b — d\\. As b lies in a tile directly below a's 
tile, we have ||o — e|| < j^yjlog n. Also, as c lies in a tile directly below 6's tile we have 
||6-c|| < ^Vlog^. Thus 

||6 — e|| > \ \a — c\ \ — \ \a — e\ \ — \ \b — c\\ 

~ I 1 N J * gn 

Using again the fact that the triangle bed is obtuse, we have 
\\b-d\\ 2 > ||5-e|| 2 + ||e-(i|| 2 . 



> (x.-^+^l) logn + He-dH 2 (1) 



N 
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Finally we have 



\a-d\\ 2 < (\\a - e\\ + \\e - d\\) 2 



e|| 2 + 2||a-e|| • ||e-d|| + \\e-d\\ 2 



logn 2Vlogn .. ., ||2 

logn 2A 2 logn 2 , n 

-lv^ + ^v^ + l|e " (i|1 ' (2) 

where the last line follows from the fact that ||e — d\\ < ||h — d|| < A 2 \/log n - 

Now our choice of iV (more specifically our choice of N > N$) guarantees that 

1 2A 2 (l + \/5) 2 
^2+^<(Ai — ) 

so that by comparing (1) and (2) we have 

\\a-d\\ < \\b-d\\ 

as claimed. □ 

As remarked Claim [1] tells us that if d is one of 6's k nearest neighbours in V 1 then d 
is one of a's nearest neighbours in V (since V ^ C) — and in particular d £Y. On the 
other hand Claim [2] tells us that if b is one of d's k nearest neighbours in V' then a is one 
of d's k nearest neighbours in V — so that again d ^ Y. 

Combining these two claims, we see that there are no edges between B and Y in 
U nj k(P'), and hence that Y is a connected component of U n ^(V) contained inside \U n . 
We thus have V' G A^, as claimed. □ 

For every V G \ C, we have thus shown there exists a tile Q such that V G Bk(Q), 
proving Theorem [71 □ 

Having established Theorem [7J the rest of the proof of Theorem is straightforward. 
We can consider a Poisson process of intensity 1 on U n as the union of two independent 
Poisson processes on U n \ Q and Q respectively. Call the corresponding random pointsets 
V and B respectively. The event Bk(Q) can then be considered as a product event 

B k (Q) = {V£B k (Q)}x{B = ®}, 

where Bk(Q) is an event depending only on the points inside U n \ Q. 
We then have 

F(B k (Q)) = F(B k (Q))F(B = 0) 

logn 



I (i? fc (Q))exp 



N 2 



<P(4)exp(-^) (3) 

where the last line follows from property (hi) of B k (Q) (which stated that if B k (Q) occurs 
then no matter what points we add to Q, the event A k will occur in the resulting modified 
pointset). 



10 



Now 



P(B k )<F(B k \C)+P(C) 

\jB k (Q) ) +P(C) by Theorem [7] 

Q J 

<£>(fl fc (Q))+P(C) 
Q 

< (MN) 2 P(A k ) exp f- + P(C) by (3) 

= (MJV) 2 n~ 1/Jva P(A fc ) + o(n~ 2 ) by Lemma© 

This concludes the proof of Theorem [5] with C3 = (MN) 2 and 04 = -^. □ 

3 Proof of the global theorem 

In this section we prove our global result, Theorem [2j It will be convenient to prove 
something slightly stronger than Theorem [2] (albeit with a more cumbersome statement), 
namely: 

Theorem 9. There exist constants j[ and > such that for every e : < e < \, all 
n > e -1 / 7 ! and all integers k £ (0.3 log n, 0.6 log n), if 

P(S n ,k connected) > e 

holds, then 

1 



P(S n ,k contains a pair of small, close components) < ^log - j n Cl . 

Proof of Theorem from Theorem The proof of Theorem 5 of establishes that there 
exists a constant 7 > such that for any k < 0.3 log n, the probability that S n)k is 
connected is o(n~ 7 ). Also, Theorem 13 of [3] shows that for k > 0.6 log n, the probability 
that S n>k contains any small component is o(n _ (°' 61og7_1 ^). Thus Theorem[2]is immediate 
from Theorem [9] together with an appropriate choice of the constants 71 and c\. □ 

Proof of Theorem [5] from Theorem Let B be the event that S n>k contains a pair of small, 
close components. We need a few results from JtJ relating local connectivity to global 
connectivity. 

Lemma 10 (Lemma 2 of [7]). For any n and any integer k with 0.3 log n < k < 0.6 log n, 
the probability that U n ^ k contains an edge of length at least Mv2°i n j S 0(n~ G ). 

(Note that our choice of M is slightly larger than in [7J ; however the proof of Lemma 2 
in that paper only used the fact that M > 30, and so holds in the present setting also.) 
To do away with boundary effects, we shall restrict our attention to 'most' of S n . Let 

2 



T n 



: n 



The nice feature of T n is that it is not very close to any of the boundary of S n . The 



following is an easy Corollary of Theorem 1 of 13)] : 
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Lemma 11. There is a positive constant < C5 < 2 such that if k > 0.3 log n then 
the probability that S n ^ contains any small component not wholly contained inside T n is 
0(n~ C5 ). 

As in 0,0], we now define two covers of T n by copies of U n . The independent cover C\ 
of T n is obtained by covering T n with copies of U n with disjoint interiors. The dominating 
cover C 2 of T n is obtained from C\ by replacing each square V E C\ by the 

twenty-five translates V + (i MV f^ -,j MV f^ ), i,j E {0, ±1, ±2}. By construction, we 
have C\ C C 2 and the copies of \U n corresponding to elements of C 2 cover the whole of 

25n 



T n . Also \C 2 \ < MT^ gn 

We shall write l A k occurs in Ci as a convenient shorthand for 'there is a copy V of U n 
in Cj for which the event corresponding to A k occurs', and similarly for B k . We shall need 
the following results from 0]: 

Lemma 12 (Lemma 5 in [7(). For all n E N anc? all integers k with 0.3 log n < k < 
0.6 log n, and C5 as gu>en by Lemma [771 

^('S'n^ connected) = P(^4/t occurs in C 2 ) + 0(n~ Cs ). 

Lemma 13 (Lemma 6 in [7[). There exists a constant 7^ > such that for all e : < 
£ < \, oil integers n > e -1 /^ and all integers k with k E (0.3 log n, 0.6 log n), if 

^{S n ,k connected) > e 

holds then 

m , . , eM 2 logn , /l 

V(A k ) < — log - 

n \e 

Similarly to Lemma [T2l we have 

Lemma 14. For all n E N and all integers k with 0.3 log n < k < 0.6 log n, and C5 as 
given by Lemma [TR 

-- F(B k occurs in C 2 ) + 0{n~° 5 ). 



Proof. Easy modification of the proof of Lemma [12J . □ 

Now, fix e : < e < \. Suppose P(S , n ,fc connected) > e. Provided n > e _1 / 7 i , we have 
by Lemma [TBI that 

m/ , . eM 2 logn , / 1 

F(A k ) < — log - 

n \e 



Now, 



(B) = F(B k occurs in C 2 ) + 0(n~ C5 ) by Lemma [JJ 

M l log n 

16 C o n l ~ C4 ( ( n^ 1 \ \ 

< irJ--i HAk) + O max n~ Cs , by Theorem [5] 

M z log n \ \ log n J J 

< 16ec3?i _C4 log ( -) + (max ( n _C5 , — ) ) by our bound on ¥(A k ) 

\ej v V logny; 

< log f - J n _c 'i 
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for all n > e and sufficiently small choices of j[ > and c' x > 0. This is the claimed 
inequality. □ 

4 The distribution of the small connected components 

In this section, we use Theorems [2] and [5] together with a form of the Chen-Stein Method 
due to Arratia, Goldstein and Gordon [1] to show that the small components in S U: k are 
asymptotically Poisson distributed in a spatial as well as a numerical sense. 

The Arratia, Goldstein and Gordon result essentially tells us that if the presence 
of one small component in a subregion of area O(logn) does not greatly increase the 
chance of having other small components in the same subregion, then the number of small 
components is Poisson distributed (just as we would expect it to be if small components 
were rare events occurring independently at random inside S n ). 

We thus proceed in two stages. First of all we find a good approximation to the 
distribution of the small components of S n ^ using local events. This requires us to use 
Theorem [21 amongst other things. Then we adapt our local result Theorem [5] to show 
that the local events we define are negatively correlated — to be more precise, they are 
independent if sufficiently far apart, and negatively dependent otherwise. This allows us 
to apply Arratia, Goldstein and Gordon to conclude. 

4.1 Local approximation 

We set up some counting functions for the small components. Given a point x E S n , let 
V n (x) be the square of area (4AvTogn) centred at x. 

(Recall that A is the constant given by Definition Q] in the introduction; with probability 
1 — o(n~ 2 ) there are no edges of length greater or equal to X^/]ogn and not more than one 
component of diameter greater than Ay / Iogra in S n> k.) 

Also let V n k(x) be the A;- nearest neighbour graph on the set of points placed inside 
V n (x) by the Poisson point process on S n . 

Definition 4. Let T be the grid T = {x G 1? : V n (x) C S n }. Given x £ T, let the local 
counting function Y{x) = Y n ^(x) be the random variable taking the value 1 if there is a 
connected component H in V n k (x) such that H has diameter less than A ylogn and x is the 
(almost surely unique) member of Z 2 closest to the (almost surely unique) bottom-most 
vertex of H. 

Pick x G r and set 

p = p(n,k) := F(Y(x) = 1). 

Note that F(Y(x) = 1) = F(Y(x') = 1) for all x,x' E T, so that the definition of p is 
independent of x. 

Definition 5. Given x G T, let the global counting function X{x) = X n ^(x) be the 
random variable taking the value 1 if there is a connected component H in S n ^ such that 
H has diameter less than A^log n and x is the (almost surely unique) member of Z 2 closest 
to the (almost surely unique) bottom-most vertex of H . 
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We shall show that whp the small components of S^ft are counted exactly by Y^ x X(x), 
and then that whp X{x) = Y{x) for all x £ T. For this we need some easy lemmas. 

Lemma 15. Suppose S n ,k contains no edge of length greater than \\J\ogn and that x GT 
is such that V nj k(x) contains no edge of length greater than X^log n. Then X(x) = Y{x). 

Proof. This is a minor modification of Lemma 4 of 0] . □ 

Lemma 16. Suppose S n ^ has at most one non-small component, no two small components 
close together and no small component close to the boundary of S n . Then there is (almost 
surely) a one-to-one correspondence between the small components of S n k and the x £ T 
for which X(x) = 1. 

Proof. Almost surely every connected component of S Ui k is counted by at most one X(x). 
Since there are no small components close to the boundary, it follows that every small 
component is counted by at least one X{x). Finally since there are no small components 
close together, every X(x) counts at most one component. □ 

Definition 6. We set 2? to be a collection of bad events. Let T> be the event that any of 
the following occur: 

(i) Sn,k contains an edge of length at least Xy/log n 

(ii) there is some x £ T for which V nt k(x) contains an edge of length at least A-y/log n 

(iii) S nt k contains at least two components of diameter at least Ay / Iogn 

(iv) S n: k contains a small component H such that the point of Z 2 closest to the bottom- 
most vertex of H does not lie in V 

( v ) S nt k contains at least two components of diameter less than X\/log n lying within 
distance less than 8X\/\og n of each other 

(vi) S n k contains a small component H such that there is more than one element of 1? 
closest to a bottom-most vertex of H 

(vii) there is some x £ T for which V n) k{x) contains a small component H such that there 
is more than one element of Z 2 closest to a bottom-most vertex of H. 

Our two previous lemmas have the following corollary: 

Corollary 17. Suppose thatT> does not occur. Then there is a one-to-one correspondence 
between the small components of S n ^ and the x for which Y{x) = 1. 

□ 

How large can this bad set T> be? All but (v) were shown whp not to occur in [7] (up 
to some trivial changes of constants); so all we need to do is apply Theorem [2j 

Lemma 18. There exists a constant cq > such that if k is an integer with k E 
(0.3 log n, 0.6 log n) and¥(S n) k connected) > n _71 ; then 

F{V) = o (riT C6 ) . 



14 



Proof. Provided we choose cq small enough, this is immediate from the properties of A 
given in Definition [1] (properties (i) and (iii)), Lemma [TOl applied n times (property (ii) - 
note we use the fact A2 < A here), Lemma [TT1 (property (iv)), Theorem [2] (property (v)) 
and the fact that almost surely no point of the Poisson process falls on the midpoints of 
two members of V and no two points of the Poisson process fall on the same horizontal 
line (properties (vi) and (vii)). □ 



4.2 Global approximation 

We now study the distribution of Y := YlxeT Y(x). Our aim is to show its law is Poisson- 
like. To achieve this we use the Chen-Stein Method 0, 11] in a form due to Arratia, 
Goldstein and Gordon [l|. 

Informally this says that provided that mutually dependent pairs of random variables 
Y{x) and Y(x') are not likely to both equal 1 and that there are not too many such pairs, 
then the distribution of Y is approximately Poisson. To state Arratia, Goldstein and 
Gordon's theorem precisely, we need some definitions and notation. 

Let x £ r. We can consider a Poisson point process on S n as the union of independent 
Poisson point processes on V n (x) and S n \V n (x). By definition Y(x) is independent of the 
point process on S n \V n (x). 

Set T x to be the set of y £ T for which V n (x) C\V n {y) / 0; this can be thought of as 
the set of possible dependencies for Y(x). 

Define 

h = E E p ( y ( x ) = WPiv) = !) 

6 2 = E E HY(x) = l,Y(y) = l). 

xeTyeT x \{x} 

Now let fj, be the mean of Y, 

/* = EF = Y J Y(x) 

X 

= l r lf> 

and recall from the introduction that Vo^A) is the probability that a Poisson random 
variable with parameter \i takes a value inside the set A. 
Then the following holds: 



Theorem 19 (Arratia, Goldstein and Gordon [l|). 



sup \P(Y £ A) — Po^{A)\ <h + b 2 . 

ACNU{0} 

Thus provided we can obtain good upper bounds on b\ and we will be close to done. 

That bi is small will follow from our assumption that the probability of S n k being 
connected is not too small. Let cs > and 72 > be strictly positive constants chosen 
sufficiently small to satisfy cs < min(l, C4, cq) and 72 < min(7i, respectively. 
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Lemma 20. There is a constant cj > such that if 7 < min (71, cq/2), k £ (0.3 log n, 0.6 log n) 

and ¥(S ny k connected) > n _7 ; £/jen 

c 7 (logn) 2 2 

p < and u < C7(Iogn) . 

n 

Lemma 21. Suppose k E (0.3 log n, 0.6 log n) and 

P(Sn,fc connected) > n~ 72 . 

T/ien 



61 = o(n" 



-c 8 \ 



Proof of Lemma\21\ from LemmaWtK We have |T| < 2n. Also, as y(x) is independent of 
Y{y) for all y with ||x — y\ \ > 4v / 2Ay / logn, we have that |r x | < 256A 2 logn. 
Now suppose F(S Ht k connected) > n~ 72 . We have 

h = p ( y ( x ) = ^(Yiy) = 1) 

xer y & x 

= £|r,b 2 

< 2n(256A 2 logn)p 2 

< 512c 7 2 A 2 ^ logn ^ by LemmaEO] 

n 

= o(n~ Cs ) for cs < 1. 

□ 

We now prove Lemma [201 
Proof of LemmaW(h Suppose that F(S ny k connected) > n~ 7 with 7 < min(7i, ^). Now 

P(S n ,k connected) < P({5„ ifc connected} n V c ) + P(P) 

= F({Y = 0} n V c ) + P(D) by Corollary [T7] 

< p(y = 0) + p(d). 

We have P(S n) fc connected) > n~ 7 by hypothesis and F(V) = o(n _C6 ) 
by Lemma [T51 so that 

P(y = 0) > n" 7 +o(n~ C6 ). 

Now by definition and Y(x') are independent random variables for any x,x' £ T with 
\\x — a/|| > 4\/2A-v/log n. Since there is a set of at least 1000A " logn members of T which are 
4 v / 2AVlogn separated, we have 

P(Y = 0) < (l_p)»*/100QA a logn 
Combining this with the lower bound for F(Y = 0) obtained above, we get 

(1 _p)«/1000A 2 logn > n - 7 + ( n -C6) = „-7 (l + ( n 7-c 6 )) 
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so that, 



p < — log(l — p) 



1000A 2 7(logn) 2 /logn 

n \ n 

cj(logn) 2 



n 

\2 



for some suitable constant cj > (since 7 < ^). In particular, /j = \T\p < C7(logn) . □ 

We now turn our attention to 62 . Here we will need to use a variant of Theorem [5) 

Lemma 22. There exists a constant c' 3 > such that if k £ (0.3 logn, 0.6 logn) and 
are points in F with x' £ F x , then 

P({Y(x) = 1} n {Y(x') = 1}) < c' 3 n" C4 P({y(x) = 1}) + o(n~ 2 ). 

Proof. Lemma [221 does not follow directly from Theorem since instead of working with a 
nice square U n , we begin instead with the union U n of two intersecting squares V n {x) and 
V n {x'). However only a slight modification of our argument in the proof of Theorem [5] is 
needed to establish Lemma 1221 

We consider a translate U' n of U n such that the centre subsquare \U' n contains V n (x) U 
V n (x'). (This is possible since M > IQOXy/Jogn while V n (x),V n (x') have side-length 
4Ai/log n and ||x — x'\\ < 4y / 2Ay / Iogn.) We consider the fe-nearest neighbour graph U' n k 
on the points placed inside U' n by our Poisson process on S n . 

We can define events Aj.{x) and Bi-(x,x') corresponding to {Y(x) = 1} and {Y(x) = 
Y{x') = 1} in a natural way: for y = x, x' let A^{y) be the event that U' nk contains a 
small connected component H such that y is the unique element of 1? closest to a bottom- 
most point of H, and let Bk(x,x') be the event that both of these happen, Bk(x,x') = 
A k {x)r\A k {x'). 

By following exactly the proof of Theorem \5\ we get 

P(B k (x,x')) < c 3 n^F(A k (x)uA k (x')) +o(n- 2 ). 

Now all we have to show is that A k {x) and A k (x') are essentially the same events as 
{Y(x) = 1} and {Y(x') = 1} respectively. This is straightforward from Lemma 4 of 0], 
Lemma [TS] and Lemma [TO], which taken together show 

F(A k (x)) = F(Y(x) = 1) + o(n- 2 ), 
F(A k (x')) = P(Y(x) = 1) + o(n~ 2 ). 

Finally 

P (A k (x) U A k (x')) < F(A k (x)) + nA k (x')) 
so that Lemma [221 follows with c' 3 = 2c%. □ 
We are now able to bound &2 from above: 
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Lemma 23. Suppose k G (0.3 log n, 0.6 log n) and 

P(SVi,fc connected) > n~ 72 . 

Then 

b 2 = o(n" C8 ). 

As cs < min(l,C4) and 72 < min(7i, cq/2), Lemma 1231 comes as a straightforward 
consequence of Lemma [ 



Proof. Suppose that ¥(S nt k connected) > n 7 with 7 < min(7i, ^). Then by Lemma [ 

(logn) 2 



P < c 7 - 



Now |r| < 2n and (as Y{x) is independent of Y(x') for all x' with \\x — x'\\ > 
4^2AVlogn), \T X \ < 256A 2 logn for all x G T. 
Thus 



62 = E E p^) = i,n«) = i) 

zeryer^A-fx} 
< Y,\ r -\ {c 3 n- c *p + o(n- 2 )) 

log n 



< 512A 2 c' 3 (log n)n l ~ c 'ip + 



l-c' A 
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< 512A C3C7 - h o 
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C'4 



o(n~ C8 ) 



by Lemma [22] 

by our bounds on |T|, |T X | 

by our bound on p 
for cs < min(l, C4). 



□ 



Corollary 24. Suppose k G (0.3 logn, 0.6 logn) and F(S n ^ connected) > n 72 . T/ien 

sup |P(Y G A) - Poy,{A)\ = o(n- Cs ). 

ACNU{0} 

Proof. The hypotheses of Lemma I2T1 and Lemma 1231 are satisfied, so that b\ + 02 = o(n~ Cs ) 
Thus by Theorem 1191 



sup |P(Y G A) - Po A1 (A)| < 61 + 6 2 = o(n" 
ACNU{0} 



□ 



4.3 Putting it together 

We now put the various pieces we have together to prove Theorem 

Corollary 1241 tells us that Y is approximately Poisson with parameter n = MY, while 
Corollary [T7] tells us that whp Y counts exactly the small components of S n> k- Our aim 
is to show that the number of small components X of S n & is approximately Poisson with 
parameter v = v{n, k), where 

v{n,k) : = — logP(S' nj fc connected). 
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Thus what we have left to show is that Po M and Po^ are approximately the same 
probability measure. We do this in two stages: first we prove /i and v are almost equal, 
and then use that to show Po^ and Vo v are approximately the same. 

Let C8 > and 72 > be the constants defined in the previous subsection. Recall that 
they satisfy eg < min(l, C4, eg) and 72 < min(7i, ^) respectively. 

Lemma 25. Suppose k G (0.3 log n, 0.6 log n) andP(S nt k connected) > n~ 72 . Then, 

v = fi + o(n- C8 / 2 ). 

Proof. We have 

e~ u = P(S nj k connected) 

= P ({S n ,k connected} nD c ) + P ({S Hjk connected} n V) 
= P {{Y = 0} n V c ) + P ({S n>k connected} n V) 
= P(Y = 0) + o (n~ C6 ) 
= e" M + o (max(n" C8 ,n" C6 )) 
= e-" + o(rT cs ) 

Now e~ v > n~ 72 and 72 < by assumption, so that 

H = - log (e' u + o(n~ C8 )) 
= u-log (l+o(n- {c8 ^ 2) )^ 
= u + o[n-^~^ 
= u + o(n" C8 / 2 ) 

as claimed. □ 

It readily follows that Po M and Po^ are close to being the same measure. 
Corollary 26. If k G (0.3 log n, 0.6 log n) and 

F(S n) k connected) > n~ 72 

both hold, then 

sup \Po v {A) ~ PoM)\ = (n~ C8/2 ) ■ 

AcNU{0} v ' 

Proof. Assume without loss of generality that v > fi. Then we can consider a Poisson 
random variable with mean v as the sum of two independent Poisson random variables 
with means \x and u — fi respectively. Thus for any set A C N U {0}, 

Po u (A) > Po M (A)Po,_ M (0) 

= Po M (A)exp(-(z/-At)) 

= Po M (A) + o(n" Cs/2 ) by Lemma [25j 



by Corollary [T71 
by Lemma fTHI 
by Corollary M 
since cs < cq 



19 



In the other direction, 

Po u (A) < Po M (A)Po^(0) + Po,_ M ([1, oo)) 

= Po^(A) exp(-(i/ - n)) + (1 - exp(-(i/ - //)) 

= Po M (yl) + o(n" Cs/2 ) by Lemma[25j 

Thus for all t4 C N U {0} we have 

|P 0/1 (A)-P 0l/ (A)| =o(n" c «/ 2 ), 

as claimed. 

□ 

Theorem [3] then follows from the triangle inequality and appropriately small choices 
of the constants C2 > and 72 > 0. 

Proof of Theorem^ Let X = X n }~ denote the number of small components in S n ^- As- 
sume F(S nj k connected) > n~ 72 . 

The proof of Theorem 5 of 0] establishes the existence of a constant 7 > such that 
for k < 0.3 log n we have 

lP(<Sn,fc connected) = o(n -7 ). 

Provided 72 < 7 and n is sufficiently large this together with our assumption on the 
connectivity of S n< k guarantees k > 0.3 log n. 

Also Theorem 13 of shows that for k > 0.6 log n the probability that S n f~ contains 
any small component is o (n _ (°' 61og7_1 )) . Thus if C2 < 0.61og7 — 1 = 0.16. . . and k > 
0.6 log n we have 

P(X = 0) = 1 - o{n~ C2 ) 

and 

v = — log (F(S nj k connected)) = o(n _C2 ) 

so that 

Po„({0}) = l-o(n- C2 ) 

and 

Po„((l,oo)) =o(n- C2 ). 

Thus in this case 

sup |P(X G A) - Po„(A)| < \F(X = 0) - Po u ({0})\ + P(X / 0) + Po„((l, 00)) 
AeNu{o} 

= o(n~ C2 ), 

so that the conclusion of Theorem [3] holds in this case. 
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Now supppose k G (0.3 log n, 0.6 log n). 

sup \P(X e A) -Po„(A)\ 
AeNujo} 

< sup \P(X £A)-P(FeA)|+ sup |P(y G A) - Po^(A)| 
AeNu{o} AeNu{o} 

+ sup \P 0fl (A)-Po u {A)\ 

AeNu{o} 

< 2P(£>) + o(n" C8 / 2 ) by Corollary El Corollary [Ml and Corollary [23 
= o(n _C2 ) by Lemma [TE[ provided C2 < min(c6, c$/2). 

Thus picking C2 and 72 to satisfy 

< c 2 < ruin (c 6 , 0.6 log 7-1, ^) 

and 

< 72 < min (7,71, 

we are done. 

□ 

4.4 Process version 

Let us conclude this paper by showing as promised that the locations of the small compo- 
nents are approximately distributed like a Poisson process. 

Let X and Y be the |T [-dimensional vectors X = (X(x)) x ^r and Y = (Y(x)) x£ r 
respectively. We define two Poisson processes on T. 

Let (Z(x)) xe r be a set of |T| independent, identically distributed random variables, 
with Z[x) ~ Poisson(p) for every x. (Recall p = p(n, k) = P(Y(x) = 1).) Set Z to be the 
resulting Poisson process on T, Z = (Z(x)) x& r- 

Similary set p' = v/\T\ and let (Z'(x)) xg r be a set of |T| independent, identically 
distributed random variables, with Z'(x) ~ Poisson(p') for every x. Set Z' to be the 
resulting Poisson process on T, Z' = (Z'(x)) xe r- 

Arratia, Goldstein and Gordon [l| gave the following process version of their Poisson 
approximation theorem: 

Theorem 27 (Arratia, Goldstein and Gordon [1]). Let b\ and 62 be as in Theorem \iy[ 
Then 

sup{|P(Y G A) -P(Z G A)\ : A C (Nu{0}) |r| } <2(6i + 6 2 ). 

We shall write D(Y, Z) for the difference 

D(Y,Z) := sup{|P(Y G A)-P{Z G A)\ : A C (NU{0}) |r| J , 

which is known as the total variation distance between the distributions of Y and Z. 
Let us now apply Theorem [27] to prove a process version of Theorem [3j 

Theorem 28. Suppose ¥(S nj k connected) > n~ 72 . Then 

£>(X,Z') = o(n" C2 ). 
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Proof. Assume ¥(S nt k connected) > n -72 . As in the proof of Theorem [3] we may assume 
0.3 log n < k < 0.6 log n provided n is sufficiently large. 

We know by Corollary 1171 that if T> c does not occur then X = Y. Theorem [27] and the 
bounds on b\ and 62 we obtained in Lemmas 1 2 1 1 and [23l tell us D(Y, Z) is at most o(n _C2 ). 
We know by Lemma [25] that v = // + o(ra _C8 / 2 ). Dividing through by |T| we get that 

p> = p + o(n^ 1+c ^), (5) 

from which it is easy to show that D(Z, Z') = o(n _C2 ): running exactly the same argument 
as in Corollary [26] but with p and p' instead of \i and v, and using (5) instead of Lemma [25] 
we get that 

sup \F(Z(x) G A) - ¥(Z'(x) €A)\ = o(n- 1_C8/2 ) 

ACNU{0} 

for all x G T. Thus 

D{Z, Z') < V sup \¥{Z(x) eA)- P(Z'(x) G ^)| 

= o(|r|n- 1 - cs/2 ) 

= o(n~ C2 ), 

as required. 

Putting it all together, we have 

D(X, Z') < D(X, Y) + D(Y, Z) + D(Z, Z') 
< 2¥(V) + o(n- C2 ) 

= o(n~ C2 ) by Lemma [HI since C2 < eg. 

□ 

What Theorem [25] effectively says is that the location of the small components of S n & 
inside S n is approximately a Poisson point process. Thus the distribution of the small 
components is approximately Poisson in both a numerical and in a spatial sense. 
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